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REMOTE AUTHENTICATION BASED ON EXCHANGING 
SIGNALS REPRESENTING BIOMETRICS INFORMATION 

FIELD OF THE INVENTION 

The present invention relates to the field of secure communications and in particular 
to techniques of remote authentication of, and secret key establishment between, 
communicating parties to protect or secure communications over insecure channels. 

BACKGROUND 

Individuals or computer systems often need to have authenticated and confidential 
communications over an open channels, such as the Internet. While such secure 
communications may be achieved by physical means, it is more cost effective and 
flexible to use cryptographic means. 

To have secure communications using cryptographic means, parties need to first 
execute a protocol to authenticate each other and at the same time establish a mutually 
agreed conventional up on secret key, which is then used to encrypt subsequent 
communications between the parties. Conventional authentication and key exchange 
protocols normally require that, the parties either share a secret (e.g., a password) or 
know each other's public keys. 

A cryptographic system, or cryptosystem, uses an encryption key to convert plaintext 
into ciphertext (an unintelligible or undecipherable form of the original information) 
and a decryption key to recover the plaintext from ciphertext. If the encryption key 
and the decryption key are identical, the cryptosystem is referred to as symmetric key 
cryptosystem. If the encryption and decryption keys are different and it is 
computationally infeasible to determine the decryption key from the encryption key, 
the cryptosystem is referred to as an asymmetric key cryptosystem or public key 
cryptosystem. In a public key cryptosystem, anyone can encrypt a message using a 
public encryption key. However, only the holder of a corresponding private 
decryption key can decrypt the ciphertext and recover the message. 
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In a public key cryptosystem, it is often important to securely bind a public key with 
the legitimate user's ID. Such a binding can be achieved using public key certificates, 
which are digitally signed and issued by a certification authority. 
Roughly speaking, a one-way hash function h() has the properties that: 

1) for any message m, the hash h(m) can be easily computed; 

2) given h(m), finding m is computationally infeasible; and 

3) finding two messages that have the same hash is computationally 
infeasible. 

For more information on cryptosystems, digital signature schemes, public key 
certificates, and one-way hash functions, reference is made to A. Menezes, P. 
Oorschot, and S. Vanstone, Handbook of Applied Cryptography . CRC Press, pp. 425- 
488, pp. 559-561, and pp. 321-383 1996; and C. Kaufinan, R. Perlman, and M. 
Speciner, Network Security — Private Communication in A Public World. PTR 
Prentice Hall, Englewood Cliffs, NJ, pp. 152-158, pp. 177-204 and pp. 101-129 1995. 

The UNIX (a trademark of Bell Laboratories) operating system provides a classical 
example of a password based authentication system. In UNIX, each user is provided 
with a unique login user name and is allowed to choose a secret password. The UNIX 
system maintains a password file containing the user name and a hash of the user's 
password computed using a one-way hash function with the user's password as input. 
When a UNIX user desires to access the UNIX system, the user keys in his or her user 
name and password to a terminal. The terminal computes the hash of the password 
and sends the hash along with the user name to the UNIX system. Because only the 
user knows the password, if the hash and user name match those in the password file, 
the user is considered authenticated. 

The UNIX password system is simple to implement, but has a number of problems. 
Firstly, it is vulnerable to a "replay" attack. That is an eavesdropper can intercept the 
user name and the hash of the password, and replay them to the UNIX system. 
Secondly, knowing the hash of a password, an eavesdropper can mount an off-line 
dictionary attack. The person can guess a password, compute its hash, and see if the 
two hash values match. The person can then systematically try passwords, one at a 
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time, until a match is found. Since people tend to choose easy to remember or "weak 
passwords", such an attack can be very effective. Thirdly, the UNIX system only 
authenticates the user, and no secret key is established to encrypt subsequent 
interactions between the user and the system. 

A number of authentication and key establishment protocols have been proposed to 
improve upon the UNIX password protocol. Examples include: 

1) R. Needham and M. Schroeder, "Using encryption for 
authentication in large networks of computers", Communications of 
the ACM, Vol. 21, December 1978, pp. 993-999; 

2) D. Otway and O. Rees, "Efficient and timely authentication", 
Operating Systems Review, Vol. 21, No. 1, January 1987, pp. 8-10; 

3) L. Gong, M. Lomas, R. Needham, and J. Saltzer, "Protecting poorly 
chosen secrets from guessing attacks IEEE Journal on Selected 
Areas of Communications, Vol. 1 1, No. 5, June 1993, pp. 648-656; and 

4) US Patent No. 5,440,635 issued to S. Bellovin and M. Merritt on 
August 8, 1995. 

A number of the conventional authentication protocols require that the parties share 
secret information (such as a password) or possess each other's public keys in 
advance. There are many potential difficulties for a human user to share secrets with a 
large number of remote parties. Firstly, it requires a secure secret distribution 
mechanism to be in place. Secondly and more importantly, human users are not good 
at remembering secrets of good quality, since such secrets look like random data. 
Knowing each other's public key in authenticated manners is also problematic in a 
distributed and open environment. 

Without good authentication and encryption, voice-over-IP (the Internet protocol) can 
be eavesdropped without much difficulty. Pretty Good Privacy Phone or PGPfone 
(both are trademarks of Pretty Good Privacy Inc.) implements an authentication 
protocol based on exchange of voice signals and Diffie-Hellman key exchange 
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protocol, P. Zimmermann, PGPfone Owner's Manual, Version J, 0 beta J, 5 January 
1996, http://web.mit.edu/network/pgpfone/manual . 

Before proceeding with a discussion of the PGPfone authentication protocol, the 
Diffie-Hellman key exchange protocol, W. Diffie and M. Hellman, "New directions in 
cryptography", IEEE Transactions on Information Theory, Vol. IT-22, No. 6, pp. 
644-654, November 1976 is reviewed. Diffie-Hellman key exchange allows two 
parties, without sharing keying material in advance, to agree to a secret key over an 
open channel, but without authentication. In Diffie-Hellman key exchange, two 
parties A and B agree on an appropriate prime p and a generator of Z* p , where Z* p = 
{x | 0< x < p-1 } . Party A generates a random number x, l<x<p-l, and then 
computes and sends to Bob g* modulo p. Party B generates a random number y, 1< y 
< p -1, and then computes and sends to party A g y modulo p. 

Party A computes a shared key k = (g*)* modulo p, and party B computes k = (g x ) y 
modulo p. The Diffie-Hellman protocol can be carried out in any group in which the 
discrete logarithm problem is difficult to solve. This protocol, however, is vulnerable 
to "man-in-the-middle" attacks. If a party C comes in the middle between parties A 
and B, when party A wishes to have a Diffie-Hellman exchange with party B, party C 
intercepts all the messages from A and B and enters the Diffie-Hellman exchange with 
A and B, respectively. As a result, C agrees a secret key with A and another secret key 
with B so that C can decrypt all the messages from A using the key shared with A and 
re-encrypt the messages using the key shared with B. 

The PGPfone authentication protocol assumes that the two parties are familiar with 
each other's voice. The two parties first establish a shared value (e.g., g* y mod p) by 
performing a Diffie-Hellman exchange. The parties next compute the hash of the 
shared value. Each party then reads the first few bytes (in hexadecimal format or in 
English words. PGPfone; maintains a list that maps the 256 values of a byte to 256 
English words) of the hash to each other. If the bytes at the two ends match and if the 
voice sounds like that of the claimed party, the parties are considered authentic. 
However, if an attacker is able to collect sound samples of all the 256 words by, for 
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example, eavesdropping on someone's phone calls, the attacker is able to impersonate 
the victim at will. 

Thus, a need clearly exists for a method of remote authentication based on exchanging 
signals representing biometrics information and establishing a cryptographic key. 

SUMMARY 

In accordance with one aspect of the invention, a method of authenticating a 
remote party and establishing a cryptographic key for secure communications via an 
insecure communications channel. The method includes the steps of: 

generating a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

generating a random number x, computing g* modulo p, where g and p are 
numbers, deriving a key k A from g* modulo p, encrypting the first challenge signal 
with k A and a symmetric key cryptosystem, and sending a first ciphertext to the 
remote party; 

receiving a second ciphertext from the remote party, sending g* modulo p to 
the remote party, and starting a clock; 

receiving a third ciphertext and g y modulo p from the remote party, stopping 
the clock, and computing an elapsed time interval of the clock; 

deriving a key k B from g y modulo p, computing g xy modulo p, deriving a key 
k^ from g xy modulo p, decrypting the second ciphertext with k B to recover a second 
challenge signal from the remote party, decrypting the third ciphertext to recover a 
first response signal from the remote party; 

verifying that the elapsed time of the clock is within a predetermined interval 
(TL A , TUJ, where TL A and TU A are positive numbers; 

verifying that the second challenge signal is produced by the remote party; 

producing a second response signal of minimum duration T, encrypting the 
second response signal with k^ and sending a fourth ciphertext to the remote party; 

verifying that the first response signal is a response produced by the remote 
party to the first challenge signal; and 



generating a key k from g* y modulo p for secure communications with the 
remote party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

In accordance with another aspect of the invention, there is disclosed a method 
of authenticating a remote party and establishing a cryptographic key for secure 
communications via an insecure communications channel. The method includes the 
steps of: 

receiving a first ciphertext from the remote party, generating a random number 
y, computing g y modulo p, where g and p are numbers; 

producing a first challenge signal of a minimum duration T, where T is a fixed 
time interval; 

deriving a key k B from g y modulo p, encrypting the first challenge signal with 
k B and a symmetric key cryptosystem, and sending a second ciphertext to the remote 
party; 

receiving g x modulo p from the remote party, deriving a key k A from g* 
modulo p, decrypting the first ciphertext to recover a second challenge signal from the 
remote party; 

verifying that the second challenge signal is produced by the remote party, 
producing a first response signal of the minimum duration T; 

computing g* y modulo p, deriving a key k^ from g* y modulo p, encrypting the 
first response signal, sending a third ciphertext and g y modulo p to the remote party, 
and starting a clock; 

receiving a fourth ciphertext, stopping the clock, computing the elapsed time 
of the clock, and decrypting the fourth ciphertext to recover a second response signal 
from the remote party; 

verifying that the elapsed time of the clock is within a predetermined interval 
(TLb, TU b ), where TL^ and TU B are positive numbers; 

verifying that the second response signal is a response produced by the remote 
party to the first challenge signal; and 



-7- 

generating a key k from g* y modulo p for secure communications with the 
remote party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

In accordance with yet another aspect of the invention, there is disclosed a 
method of authenticating a remote party and establishing a cryptographic key for 
secure communications via an insecure communications channel. The method 
includes the steps of: 

generating a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

generating a random number x, computing g* modulo p, where g and p are 
numbers, deriving a key k A from g* modulo p, encrypting the first challenge signal 
with k A and a symmetric key cryptosystem, and sending a first ciphertext to the 
remote party; 

receiving a second ciphertext, sending g* modulo p to the remote party, and 
starting a clock; 

receiving g y modulo p, computing a key k B from g y modulo p, decrypting the 
second ciphertext to recover a second challenge signal from the remote party; 

verifying the second challenge statement to ensure that the second challenge 
statement is produced by the remote party, and producing a first response signal of 
minimum duration T; 

computing g xy modulo p, deriving a key k^ from g xy modulo p, encrypting the 
first response signal and sending a third ciphertext to the remote party; 

receiving a fourth ciphertext from the remote party, stopping the clock, 
decrypting the fourth ciphertext with k^ to recover a second response signal from the 
remote party; 

verifying that the elapsed time of the clock is within a predetermined interval 
(tl A , tuj, where tl A and tu A are positive numbers; 

verifying that the second response signal is a response produced by the remote 
party to the first challenge signal; and 
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generating a key k from g xy modulo p for secure communications with the remote 
party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

In accordance with a further aspect of the invention, there is disclosed a 
method of authenticating a remote party and establishing a cryptographic key for 
secure communications via an insecure communications channel. The method 
includes the steps of: 

receiving a first ciphertext from remote party, generating a random number y, 
and computing g y modulo p, where g and p are numbers; 

producing a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

deriving a key k B from g y modulo p, encrypting the first challenge signal with 
k B and a symmetric key cryptosystem, and sending a second ciphertext; 

receiving g* modulo p, computing a key k A from g* modulo p, decrypting the 
first ciphertext to recover a second challenge signal from remote party, sending g y to 
remote party and starting a clock; 

verifying the second challenge statement to make sure that the second 
challenge statement is produced by the remote party, and then producing a first 
response signal of minimum duration T; 

computing g xy modulo p, deriving a key k^ from g xy modulo p, encrypting the 
first response signal and sending a third ciphertext to the remote party; 

receiving a fourth ciphertext from the remote party, stopping the clock, 
decrypting the fourth ciphertext with k^ to recover a second response signal from the 
remote party; 

verifying that the elapsed time of the clock is within an interval (tl B , tu B ), 
where tl B and tu B are positive numbers; 

verifying that the second response signal a response produced by the remote 
party to the first challenge signal; and 
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generating a key k from g xy modulo p for secure communications with the 
remote party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

A small number of embodiments of the invention are described hereinafter with 
reference to the drawings, in which: 

Fig. 1 is a block diagram illustrating a communication system model between two 
remote individuals over an open transmission channel; 

Fig. 2 is a flowchart depicting the operation of a first embodiment of the invention; 
Fig. 3 is a flowchart showing the operation of a second embodiment of the invention; 
Fig. 4 is a flowchart depicting the first scenario of a man-in-the-middle attack in the 
first embodiment of Fig. 2; 

Fig. 5 is a flowchart showing the second scenario of a man-in-the-middle attack in the 
first embodiment of the invention of Fig. 2; 

Fig. 6 is a block diagram illustrating a communication device in accordance with the 
first embodiment of the invention; and 

Fig. 7 is a block diagram of a general-purpose computer with which the embodiments 
of the invention can be practised. 

DETAILED DESCRIPTION 

A method, an apparatus, and a computer program product for remote authentication 
based on exchanging signals representing biometrics information and establishing a 
cryptographic key are described. In the following description, numerous details are 
set forth including communications channels for example. It will be apparent to one 
skilled in the art, however, that the present invention may be practised without these 
specific details. In other instances, well-known features are not described in detail so 
as not to obscure the present invention. 
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The detailed description is organised as follows: 



1. 



Notation and Definitions 



2. 



Block Diagram of Communications Device 



3. 



First Embodiment 



4. 



Second Embodiment 



6. 



5. 



Security Considerations 
Computer Implementation 



In the following description, components of the system are described as modules. A 
module, and in particular its functionality, can be implemented in either hardware or 
software. In the software sense, a module is a process, program, or portion thereof, 
that usually performs a particular function or related functions. In the hardware sense, 
a module is a functional hardware unit designed for use with other components or 
modules. For example, a module may be implemented using discrete electronic 
components, or it can form a portion of an entire electronic circuit such as an 
Application Specific Integrated Circuit (ASIC). Numerous other possibilities exist. 
Those skilled in the art will appreciate that the system can also be implemented as a 
combination of hardware and software modules. 

1 . Notation and Definitions 

The following notation is used throughout: 



d(k, c): 



A, B : 



e(k, m) : 



Parties (Alice and Bob, respectively) that seek to have 
secure communications; 

Encrypted message containing original message m and 
a key k using a symmetric key cryptosystem; 
Decrypted message containing a ciphertext c and the 
key k using a symmetric key cryptosystem; 
An acoustic wave or digital representation of a 
challenge statement spoken by a party X (either A or 
B); whether C x is an acoustic wave or a digital 
representation should be clear to those skilled in the art 
from the discussion context; 
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R Y : 



An acoustic wave or digital representation of a response 
statement spoken by a party Y in reply to C x ; whether 
R Y is an acoustic wave or a digital representation should 
be clear to those skilled in the art from the discussion 



context; 



ICxI: 
|R Y |: 



the time duration of C x ; 
the time duration of R Y ; and 

A required minimum time duration of any statement 



T : 



spoken by a party. 

In the following description, "secure communications" means communications that 
are authenticated or confidential. 

Fig. 1 illustrates a general model 100 of communications between two remote 
individuals. In this model 100, Alice 110 and Bob 150 are two individuals, who are 
familiar with each other's biometrics characteristics (without loss of generality, voice 
signals are used hereinafter) and wish to have a secure communication. The 
transmission channel 130 represents the means, and more specifically the media, 
through which communication messages are exchanged between the communication 
devices 120, 140. The transmission channel 130 includes, but is not limited to, any 
communications means or media such as computer networks, public telephone 
switching networks, and radio links. Alice 110 and Bob 150 communicate with each 
other by interfacing through communication devices (A) 120 and (B) 140, 
respectively. Devices (A, B) 120, 140 have appropriate speech signal processing 
capabilities (such as speech encoding and decoding). The devices 120, 140 accept 
audio input, either directly or indirectly, from Alice 110 and Bob 140, respectively, 
and output the other party's received audio signal. 

In the embodiments of the invention, devices (A, B) 120, 140 each exchange signals 
using a Diffie-Hellman key exchange system and a symmetric key cryptosystem. For 
the purposes of illustration only, both devices (A, B) 120, 140 are assumed to have 
access to a common set of Diffie-Hellman parameters, g and p, which may be 
distributed either off-line or on-line. 
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The minimum duration of any statement spoken by a party 110, 150 is required to be 
T, where T is either pre-fixed or agreed upon by Alice 110 and Bob 1 50. For security 
reasons, T should be much longer than the channel round-trip delay and the 
processing delay of each device 120, 140. For example, T may be in the range of tens 
of seconds to several minutes, while channel round- trip delay and processing delays 
are normally less than one second. To keep notation compact, only residues modulo 
are used hereinafter that is, g* modulo p, g y modulo p, and g* y modulo p are written 
simply as g\ g y , and g xy , respectively. In addition, while not mentioned explicitly, 
devices (A and B) 110, 150 are assumed to perform audio encoding/decoding and 
audio compression/decompression operations, whenever necessary. 

In the embodiments of the invention, the communicating parties are assumed to be 
familiar with each other's voice (biometrics characteristics in general) and able to 
recognise each other by listening to each other's speech. This is a reasonable 
requirement, since there are generally no confidential topics between two strangers 
without the involvement of a trusted third party. In the embodiments, party 110 
(Alice) starts a session with party 150 (Bob) by speaking a challenge statement such 
as: 

"This is Alice! The time is 21 minutes passed 9AM. How was your 
mid-term examination, Bob?". 

Upon hearing Alice's message, party 150 (Bob) is expected to speak a response 
statement like: 

"Hi, Alice! Bob's here. My mid-term exam was not very good. But 
thank God, it is over!". 

In the embodiments, Alice is assumed to be able to distinguish whether the stated 
response is in Bob's voice and whether the stated response corresponds to her 
challenge statement in the first place. Furthermore, it is assumed that it is difficult for 
an attacker to mimic a target party's voice to produce a meaningful response statement 
in real-time. 
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2. Block Diagram of Communications Device 

Fig. 6 is a block diagram of a communication device 600 in accordance with the 
embodiments of the invention. The communications device 600 includes a speech 
encoding/decoding module 632, a control module 636, an encryption/decryption 
module 640, a key generator 650, a Diffie Hellman key exchange system 660, a timer 
module 670, an input/output (I/O) module 680 for transmitting and receiving data via 
the communications channel 610, and a memory 682. Optionally, the device 600 also 
includes a transducer module 630 and an audio compression/decompression module 
634. 

The control module 636 controls the components of the device 600 via signals 690A - 
6901. The transducer module 630 converts audio 620 into audio signals and vice 
versa. Operation of the transducer module 630 can be controlled by the control 
module 636 via a control signal(s) 690A. In turn, the transducer 630 is coupled 
bidirectionally with the speech encoding/decoding module 632. This module 632 
speech encodes and decodes data and is controlled by the control module 636 via the 
signal(s) 690B. Optionally, the speech encoding/decoding module 632 is coupled 
bidirectionally to an audio compression/decompression module 634. Otherwise, the 
module 632 can be directly coupled to an encryption/decryption module 640. The 
control module 636 is coupled to the audio compression/decompression module 634 
via the control signal 690C. 

The encryption/decryption module 640 is optionally coupled to the audio 
compression/decompression module 634 and is coupled to the control module 636 via 
a control signal(s) 690D. The encryption/decryption module 640 is also coupled to 
memory 682, Diffie Hellman key exchange system 660, timer module 670, and the 
I/O module 680 by bus 692. The key generator 650 is coupled between the Diffie 

Hellman key exchange system 660 and the encryption/decryption module 640. The 

key generator 650 produces a key k from a code G z and is coupled to the control 
module 636 via a control signal(s) 690E. The memory 682 is coupled to the control 
module 636 via a control signal(s) 690G. 
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The Diffie Hellman key exchange system 660 has as inputs code G, prime number P, 
random value X, and a control signal(s) 690F from control module 636. The timer 
module 670 has an interrupt (INT) output to the I/O module 680 that allows the timer 
module 670 to interrupt transmission/reception of data via the I/O module 680. The 
timer module 670 is coupled to the control module 636 via a control signal 6901. The 
I/O module 680 is coupled to the control module 636 by control signal(s) 690H. 
Operation of the communication device 600 is described hereinafter with reference to 
the first and second embodiments. 

The embodiments of the invention advantageously employ mechanisms that enable 
users to authenticate each other and have secure communications over open or 
insecure channels by using a different technique. More particularly, users are not 
required to share or remember any secret key or password, or possess each other's 
public keys in advance. Authentication and key establishment are achieved by 
exchanging signals representing a remote party's biometrics information. In this 
connection, the embodiments of the invention utilize a cryptographic one-way hash 
function. The embodiments concentrate on authenticating remote human users based 
on the interaction of signals representing a remote party's biometrics information 
(such as acoustics waves). Parties are assumed to be able to identify each other based 
on the exchanged biometrics signals. There is no need for the parties to share any 
secret key, or know each other's public key in advance. The embodiments of the 
invention can be advantageously employed in applications such as Internet 
telephoning or voice-over-IP (Internet Protocol). 

3. First Embodiment 

Fig. 2 is a flowchart illustrating the method of secure communications according to 
the first embodiment of the invention. The flowchart is organised in columns in the 
following order: Alice 1 1 0, dev ice (A) 120, device (B) 140, and Bob 150. In step 
210, Alice 110 speaks a challenge statement C A , which is input to Device A (shown as 
device 600 in Fig. 6). Preferably, the challenge statement C A contains some 
"freshness" elements, such as the date and time, and news headlines of the day. In 
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step 212, Device A generates a random number x and computes g\ Device A then 
preferably computes a key k A from code g* using the key generator. Next, Device A 
encrypts the challenge statement C A with key k A using the key generator and sends the 
ciphertext, referred to as Message Al : 
e(k A , CJ, 

preferably together with Alice's identity to Device B over the transmission channel 
610. 

In step 214, Device B receives the Message Al and prompts Bob to speak a challenge 
statement C B in step 216. In step 218, Device B generates a random number y, 
computes g y and preferably a key k B from the code g y for a symmetric key 
cryptosystem. Device B then encrypts C B and transmits the ciphertext, referred to as 
Message A2: 

e (k B , C B ), 

preferably together with Bob's identity to Device A. 

In step 220, Device A receives the Message A2. In step 222, Device A sends to 
Device B the code, referred to as Message A3: 

and starts a clock or timer to measure the time interval for a response. In step 224, 
after receiving code g*, Device B computes the key k A from the code g* and decrypts 
the ciphertext received in Message Al to recover the challenge statement C A . Device 
B then plays back the challenge statement C A to Bob. In step 226, Bob listens to the 
challenge statement C A and verifies that the voice belongs to Alice. If the verification 
fails, Bob terminates the session. Otherwise, in step 228, Bob speaks a response 
statement R B in reply to the challenge statement C A (e.g., by iterating C A in his own 
voice). 

In step 230, Device B computes the code (g x ) y = g xy and a key k^ from g* y in well- 
known fashion for the symmetric key cryptosystem, and encrypts R B with k^ and the 
symmetric key cryptosystem to obtain e^^, R B ). In step 232, Device B sends, as 
Message A4, the following: 



g y , eCk^, R B ), 
to Device A and then starts its clock. 
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In step 234, upon receipt of Message A4, Device A first stops its clock started in step 
222. Time t A is the elapsed time of the clock. Device A then computes the code (g*)* 
= g^, the key k^ from code g^, and the key k B from the code g y . Device A then 
decrypts e(k B , C B ) with the key k B and e(k AB> R B ) with key k^ to recover the challenge 
statement C B and the response R B , respectively. |C A | is the duration of the challenge 
statement C A , and |Rb| is the duration of the response R B . It will be readily apparent to 
those skilled in the art how to obtain the duration of an audio signal. Let TL A = |C A | + 
|R B |. Further, in step 230, Device A checks the elapsed time t A to see if: 

t A e (TL A , TUJ, (1) 
where TU A can be taken advantageously as T + TL A if the channel round trip delay 
and processing delay in Devices A and B 120, 140 are negligible compared with T. 
Such delays can be easily incorporated into Equation (1) by those skilled in the art. If 
t A g (TL A , TUa) is not true, Device A terminates the session. Otherwise, in step 236, 
Alice listens and verifies the challenge statement C B . 

If Alice recognises that the challenge statement C B is not Bob's voice, Alice 
terminates the session. Otherwise, in step 238, Alice speaks a response statement R A 
(e.g., by iterating C B in her own voice) in reply to the challenge statement C B . In step 
240, Device A encrypts the response statement R A with the key k^ and sends to 
Device B the ciphertext, as Message A5: 
e (kAB> Ra)- 

In step 248, following step 238, Alice listens and verifies the response statement R B . 
If the response statement R B is not either a response to the challenge statement C A or 
in Bob's voice, Alice stops the session. If Alice is sure that the response statement R B 
is a reply to the challenge statement C A in Bob's voice, she begins communicating in 
step 250. 

Next, from step 240 in step 242, upon receipt of Message A5, Device B stops its clock 
or timer. The time t B is the elapsed time of the clock. Also, Device B decrypts eCk^, 
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Ra) with the key to obtain the response statement R A . Let |C B | denotes the 
playback duration of C B and |R A | the playback duration of R A . Let TLq = |C B | + |R A |. 
In step 242, Device B checks the elapsed time to see if: 

t B e (TLb, TU b ), (2) 
where TU B can be taken advantageously as T + TLg if the channel round trip delay 
and processing delay at Devices A and B are negligible in comparison with T. If 
Equation (2) is not satisfied, Device B terminates the session. Otherwise, Device B 
plays back the response statement R A to Bob. In step 244, Bob listens and verifies the 
response statement R A . If Bob recognises that R A is a reply to the challenge statement 
C B in Alice's voice, Bob can be confident that he is communicating with Alice and he 
can proceed to step 246. Otherwise, Bob stops the session. 

In steps 250 and 246, respectively, Bob and Alice preferably communicate with each 
other. In step 252, Device A encrypts messages from Alice and decrypts messages 
from Bob with a key derived from g xy , preferably using another symmetric key 
cryptosystem. Similarly, in step 256, Device B likewise encrypts messages from Bob 
and decrypts message from Alice with a key derived from g xy in the same way as 
Device A and with the same symmetric key cryptosystem as used by Device A. 

4. Second Embodiment 

Fig. 3 is a flowchart illustrating a method of secure communications according to the 
second embodiment of the invention. In step 310, Alice starts the session by speaking 
d challenge statement C A . In step 312, Device A generates a random value x, 
computes a code g* and a key k A from g* for a symmetric key cryptosystem, encrypts 
the challenge statement C A with k A and sends to B the ciphertext, as Message B 1 : 
e(k AJ C A ). 

In step 314, Device B receives Message Bl. Next, in step 316, Device B prompts Bob 
to speak a challenge statement C B „in„step 318. In step 316, Device B then generates a 
random number y, computes code g y and a key k B from g y for a symmetric key 
cryptosystem, encrypts C B with k B and sends to Alice the ciphertext, as message B2: 
e (k B > C B ). 
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In step 320, Device A receives Message B2. Next, in step 322, Device A sends to 
Bob, as Message B3: 

and starts a clock. 



In step 324, upon receipt of the Message B3, Device B computes a key k A from code 
g\ decrypts e(k A , to recover C A , sends to Alice as Message B4: 
g y , 

and starts a clock. Device B then outputs the challenge statement C A to Bob. In step 
326, Bob listens and verifies whether the challenge statement C A is in Alice's voice. 
Bob terminates the process if he has doubts on the originality of the challenge 
statement C A . When the challenge statement C A is verified successfully by Bob, in 
step 328, Bob speaks a response statement Kq in reply to the challenge statement C A . 
In step 330, Device B computes code g** and a key k^ from g xy , encrypts R B and 
sends to Alice the ciphertext, as Message B5: 
^k^, R B ). 

On the other hand, Message B4 is received by Device A in step 332. Device A 
computes k B from g y , decrypts e(k B , QJ to recover C B . Alice listens and verifies C B in 
step 334. Alice stops the process if she believes that C B is not in Bob's voice; 
otherwise, she speaks a response statement R A in reply to C B in step 336. In step 338, 
Device A computes g** and k^ from g 5 *, encrypts R A and sends to Bob the ciphertext, 
as Message B6: 
eCk^, R A ). 



In step 340, Device A receives Message B5. Device A also stops the clock, decrypts 
e O c AB> Rb) to recover the response statement R B and checks to see if the elapsed time 
of the clock T A satisfies the following: 

T A € (tl A , tuj, (3) 
where preferably tl A = |C A | + |R^| and tu A = tl A + T. Device A terminates the session if 
Equation (3) is not satisfied. Otherwise, Device A outputs the response statement R B 
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to Alice. In step 342, Alice listens and verifies the response statement R B . Alice stops 
the session if she is not convinced that the response statement R B is Bob's response to 
the challenge statement C A . Otherwise, in step 344, Alice starts communications with 
Bob. 

In step 348, upon receipt of the Message B6, Device B stops its clock, decrypts eCk^, 
R A ) to recover R A . Device B, then checks to see if the elapsed time of its clock 
satisfies the following: 

T B e (tl B , tu B ) (4) 
where preferably tl B = |C B | + |R A | and tu B = tl B + T. Device B terminates the session if 
Equation (4) is not satisfied. Otherwise, Device B outputs the response statement R A 
to Bob. In step 350, Bob listens and verifies R A . Bob stops the session if he is not 
convinced that the response statement R A is Alice's response to C B . Otherwise, in step 
352, he starts communications with Alice. 

With successful authentication of both Alice and Bob, Devices A and B, derive a key 
from g* y and use the key to encrypt and decrypt messages between Alice and Bob in 
steps 346 and 354, respectively, using communications obtained in steps 344 and 352. 

5. Security Considerations 

Symmetric key cryptosystems are used in the embodiments to encrypt challenge 
signals and response signals. It is important that all encryptions resist data 
modification (such as cut and paste) attacks. 

The symmetric key cryptosystem used to encrypt challenge and response signals can 
be replaced by a cryptographic commitment function. Such a commitment function 
has the following properties: 

1) no one can modify the contents of the commitment without being 
detected; and 

2) no one can get any information about its contents unless the 
committing party discloses the information. 
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One way to form a commitment function is using a cryptographic one-way hash 
function h(). To commit to an item I, the committing party computes the commitment 
h(k || I), where k is a secret key and k || I is the concatenation of k and I. To verify the 
commitment, the verifying party must have k and I, compute h (k || I) and compare h 
(k || I) with the commitment. 

Checking the lengths of the elapsed time intervals of the clocks in both illustrative 
embodiments is very important in detecting any man-in-the-middle attacks. This is 
illustrated using the two attacking scenarios respectively, in relation to the first 
embodiment of Fig. 2. A similar analysis can be done for the second embodiment. In 
the following descriptions, the term "Alice" is used to refer to both the user Alice and 
Device A. Similarly, the term "Bob" is used to denote both the user Bob and Device 
B. 

Fig. 4 depicts a scenario where Alice attempts to set up a communications session 
with Bob and where Clark performs a man-in-the-middle attempt to impersonate Bob 
to Alice. The flow diagram is accordingly organised into three columns: Alice, Clark 
and Bob. In step 410, Alice starts a session by generating a random number x, 
computing g\ speaking a challenge statement C A , computing k A from g*, encrypting 
C A with k A , and sending the ciphertext e(k A , CjJ to Bob. 

In step 412, the ciphertext is intercepted by Clark. Clark generates a number z, 
computes g 2 and a key from g 2 , encrypts an old challenge statement C' B from Bob, 
and sends the ciphertext e(kc, C' B ) to Alice. In step 414, Alice receives the ciphertext, 
replies with g* and starts a clock. The code g* is again intercepted by Clark at step 
416. In step 416, Clark derives the key k A from g* and decrypts e(k A , CjJ to recover 
C A . Clark cannot mimic Bob's voice to produce a meaningful response statement 
so Clark impersonates Alice and starts a new session with Bob by sending Bob e(kc, 

cj. 



In step 418, upon receipt of e(kc, QJ, Bob generates a random value y, and computes 
code g y and a key k B . Bob then speaks a challenge statement C B , encrypts the 
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challenge statement using key k B , and sends the ciphertext e(k B , C B ) to Alice. Clark 
intercepts the ciphertext in step 420. To continue impersonating Alice, Clark sends g 2 
to Bob. In step 422, Bob computes key kc from g 2 , decrypts e^, QJ, and listens C A , 
which was indeed spoken by Alice. Bob speaks a response statement R B , computes a 
key k BC from g 2 *, encrypts R B with k BC , and transmits g y and the ciphertext e(k BC , R B ). 
In step 424, Clark again intercepts the ciphertext, computes the key k BC from g^, and 
decrypts e(k BC , R B ). Now Clark gets R B . Clark then computes k AC from g* 2 , encrypts 
R B with k AC , and sends the ciphertext e(k AC , R^) to Alice. 

In step 426, Alice stops the clock, decrypts e(kc> C' B ) and e(k AC , R B ) to recover C' B 
and R B , respectively. Alice also listens and verifies that C' B is in Bob's voice. Alice 
then speaks and encrypts a response statement R A , and sends the ciphertext e(k AC , R^. 

Next, in step 428, Alice listens and verifies the response statement R B . Since R B is 
indeed a response to C A from Bob, Alice is fooled into believing Clark is Bob. 
However, the embodiments of the invention prevent this from happening by checking 
the clock's elapsed time t A against the interval (TL A , TU A ) per Equation (1), where 
TL A = |C A | + |R B |, and TU A = TL A + T. 

To appreciate the rationale behind Equation (1), without the man-in-the-middle attack 
by Clark, t A = |C A | + |Rb| + A A „ where A A1 is the delay due to processing, transmission, 
and a pause interval introduced by Bob after listening to the challenge statement C A , 
but before speaking the response statement R B . However, with the man-in-the-middle 
attack shown in Fig. 4, t A = |C A | + |R B | + |C B | + A^, where A^ is a delay similar to A A1 . 
Since it is required that T » A AI and A^ and that |C B | < T. It can be readily 
appreciated that t A = |C A | + |R B | + |C B | + A^ > TL A + T = TU A with the attack and that 
*a = |C A | + |R B | + A A1 w TL A < TU A without the attack. Therefore, by checking t A 
against Equation (1), the man-in-the-middle attacked can be detected. 



Fig. 5 shows a second scenario 500 of a man-in-the-middle attack, where Clark 
impersonates Alice to Bob. Again the flow diagram is organised in columns: Alice, 
Clark, and Bob. In step 510, Clark generates z, computes g 2 and a key from g 2 , and 
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encrypts C' A - an old statement from Alice. Clark starts the impersonation by sending 
the ciphertext e(kc, C'a) to Bob. 

In step 512, upon receipt of the message from Clark, Bob generates y, g y , and a key k B 
from g y . Bob then speaks a challenge statement C B , encrypts the challenge statement 
with k B , and transmits the ciphertext e(k B , C B ) to Alice. In step 514, the ciphertext is 
intercepted by Clark and Clark sends g 2 to Bob. In step 516, Bob derives the key 
from g 2 , and decrypts e(kc, C'a) with kc to recover C' A . Bob listens to the challenge 
statement C' A and believes that C' A was indeed spoken by Alice. Bob then speaks a 
response statement R^, derives a key k BC from g^, encrypts R B , transmits the 
ciphertext e(k BC , R B ) and g y and starts a clock. 

Next, in step 518, upon interception of the ciphertext e(k BC , R B ) and g y , Clark derives 
k B from g y and k BC from g^, and decrypts e(k B , C B ) to recover C B . Since Clark is not 
able to reply to the challenge statement C B in Alice's voice, Clark encrypts the 
challenge C B with k^ and starts a session with Alice by sending e^, C B ) to Alice. In 
step 520, upon receipt of e(kc, Cb), Alice generates x, g\ and a key k A from g\ Alice 
then speaks a challenge statement C A , encrypts the challenge statement with k A , and 
sends the ciphertext e(k A , QJ. In step 522, Clark intercepts the ciphertext and sends 
g 2 to Alice. In step 524, Alice derives a key k AC from g 2 *, and decrypts e^, C B ) to 
recover C B . Alice then listens to the challenge statement C B and believes that she 
hears Bob's voice. Alice then speaks a response statement R A , encrypts the response 
statement with k AC and sends g* and the ciphertext e(k AC , R A ). In step 526, Clark 
intercepts the message from Alice and decrypts the ciphertext to obtain the response 
R A . Clark then encrypts the response statement with k BC and sends the ciphertext 
e(k BC , RJ to Bob. 

In step 528, Bob receives e(k BC , Ra), stops the clock, and decrypts the ciphertext to 
recover R A . Without checking the elapsed time of the clock, t B , Bob can be misled into 
believing that Bob is communication with Alice since R A is Alice's reply to the 
challenge statement C B . This attack can be foiled easily by checking t B against the 
interval (TLb, TU b ) (see Equation 2), where TLq = |C B | + |R A | and TU B = TLb + T. 
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Without the man-in-the-middle attack by Clark, t B = |C B | + |R A | + A B1 , where A B1 is the 
delay due to processing, transmission, and a pause interval introduced by Alice after 
listening to C B but before speaking the response statement R A . With the man-in-the- 
middle attack of Fig. 5, t B = |Cb| + R A | + |C A | + A B2 , where A B2 is a delay similar to A B1 , 
It is required that T » A B1 and A B2 and that |C A | < T. Then it can be seen that t B = |C B | 
+ R A | + |C A | + A B2 < TLb + T = TU B with the attack and t B = |C B | + R A | + A B1 « TLb < 
TU B without the attack. Therefore, by checking t B against Equation (2), the man-in- 
the-middle attack can be detected. 

6, Computer Implementation 

The embodiments of the invention are preferably implemented using a computer(s), 
such as the general-purpose computer shown in Fig. 7. In particular, the processes of 
Figs. 2 and 3 can be implemented as software, or a computer program, executing on 
the computer, where each communication device 120, 140, 600 of Figs. 1 and 6 can 
be implemented using a general purpose computer. The method or process steps for 
remote authentication based on exchanging signals representing biometrics 
information are effected by instructions in the software that are carried out by the 
computer. The software may be implemented as one or more modules for 
implementing the process steps. A module is a part of a computer program that 
usually performs a particular function or related functions. Also, as described 
hereinbefore, a module can also be a packaged functional hardware unit for use with 
other components or modules. 

In particular, the software may be stored in a computer readable medium, including 
the storage devices described below. The software is preferably loaded into the 
computer from the computer readable medium and then carried out by the computer. 
A computer program product includes a computer readable medium having such 
software or a computer program recorded on it that can be carried out by a computer. 
The use of the computer program product in the computer preferably effects an 
advantageous apparatus for remote authentication based on exchanging signals 
representing biometrics information in accordance with the embodiments of the 
invention. 
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The computer system 700 consists of the computer 702, a video display 716, and 
input devices 718, 720. In addition, the computer system 700 can have any of a 
number of other output devices including line printers, laser printers, plotters, and 
other reproduction devices connected to the computer 702. The computer system 700 
can be connected to one or more other computers via a communication interface 708b 
using an appropriate communication channel 730 such as a modem communications 
path, a computer network, or the like. The computer network may include a local area 
network (LAN), a wide area network (WAN), an Intranet, and/or the Internet. 

The computer 702 itself consists of a central processing unit(s) (simply referred to as a 
processor hereinafter) 704, a memory 706 which may include random access memory 
(RAM) and read-only memory (ROM), input/output (IO) interfaces 708 A, & 708B a 
video interface 710, and one or more storage devices generally represented by a block 
712 in Fig. 8. The storage device(s) 712 can consist of one or more of the following: 
a floppy disc, a hard disc drive, a magneto-optical disc drive, CD-ROM, magnetic 
tape or any other of a number of non-volatile storage devices well known to those 
skilled in the art. Each of the components 704 to 712 is typically connected to one or 
more of the other devices via a bus 714 that in turn can consist of data, address, and 
control buses. 

The video interface 710 is connected to the video display 716 and provides video 
signals from the computer 702 for display on the video display 716. User input to 
operate the computer 702 can be provided by one or more input devices 708B. For 
example, an operator can use the keyboard 718 and/or a pointing device such as the 
mouse 720 to provide input to the computer 702. 

The system 700 is simply provided for illustrative purposes and other configurations 
canbe employed without departing from the scope and spirit of the invention. 
Computers with which the embodiment can be practiced include IBM-PC/ATs or 
compatibles, one of the Macintosh ™ family of PCs, Sun Sparcstation ™, a 
workstation or the like. The foregoing are merely exemplary of the types of 
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computers with which the embodiments of the invention may be practiced. Typically, 
the processes of the embodiments, described hereinafter, are resident as software or a 
program recorded on a hard disk drive (generally depicted as block 712 in Fig. 7) as 
the computer readable medium, and read and controlled using the processor 704. 
Intermediate storage of the program and pixel data and any data fetched from the 
network may be accomplished using the semiconductor memory 706, possibly in 
concert with the hard disk drive 712. 

In some instances, the program may be supplied to the user encoded on a CD-ROM or 
a floppy disk (both generally depicted by block 712), or alternatively could be read by 
the user from the network via a modem device connected to the computer, for 
example. Still further, the software can also be loaded into the computer system 700 
from other computer readable medium including magnetic tape, a ROM or integrated 
circuit, a magneto-optical disk, a radio or infra-red transmission channel between the 
computer and another device, a computer readable card such as a PCMCIA card, and 
the Internet and Intranets including email transmissions and information recorded on 
websites and the like. The foregoing are merely exemplary of relevant computer 
readable mediums. Other computer readable mediums may be practiced without 
departing from the scope and spirit of the invention. 

Thus, a method, an apparatus, and a computer program product for remote 
authentication based on exchanging signals representing biometrics information and 
establishing a cryptographic key have been described. While only a small number of 
embodiments are described, it will be apparent to those skilled in the art, in view of 
this disclosure, that numerous changes and/or modifications can be made without 
departing from the scope and spirit of the invention. 
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CLAIMS 

1 . A method of authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said method including the steps of: 

generating a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

generating a random number x, computing g* modulo p, where g and p are 
numbers, deriving a key k A from g* modulo p, encrypting said first challenge signal 
with k A and a symmetric key cryptosystem, and sending a first ciphertext to said 
remote party; 

receiving a second ciphertext from said remote party, sending g* modulo p to 
said remote party, and starting a clock; 

receiving a third ciphertext and g y modulo p from said remote party, stopping 
the clock, and computing an elapsed time interval of said clock; 

deriving a key from g y modulo p, computing g xy modulo p, deriving a key 
k^g from g* y modulo p, decrypting said second ciphertext with k^ to recover a second 
challenge signal from said remote party, decrypting said third ciphertext to recover a 
first response signal fi-om said remote party; 

verifying that said elapsed time of the clock is within a predetermined interval 
(TL A , TU A ), where TL A and TU A are positive numbers; 

verifying that said second challenge signal is produced by said remote party; 

producing a second response signal of minimum duration T, encrypting said 
second response signal with k^ and sending a fourth ciphertext to said remote party; 

verifying that said first response signal is a response produced by said remote 
party to said first challenge signal; and 

generating a key k from g* y modulo p for secure communications with said 
remote party. 

2. The method according to claim 1, wherein T is larger than the ch annel 
transmission and processing delay. 
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3. The method according to claim 1, wherein said challenge signals and 
response signals represent biometrics characteristics of the producing parties. 

4. The method according to claim 3, wherein said biometrics 
characteristics include the voice of a person. 

5. The method according to claim 1 or 3, wherein verification of said first 
response signal and said second challenge signal from said remote party is based on 
familiarity of remote party's biometrics characteristics. 

6. The method according to claim 1, wherein encryption of said challenge 
and response signals is performed using a cryptographic commitment function. 

7. The method according to claim 1, where TL A is tj + tj and TU A is t, + ^ 
+ T, with t t being the duration of said first challenge signal and being the duration 
of said first response signal. 

8. An apparatus for authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said apparatus including: 

means for generating a first challenge signal of minimum duration T, where T 
is a fixed time interval; 

means for generating a random number x, computing g* modulo p, where g 
and p are numbers, deriving a key k A from g* modulo p, encrypting said first challenge 
signal with k A and a symmetric key cryptosystem, and sending a first ciphertext to 
said remote party; 

means for receiving a second ciphertext from said remote party, sending g* 
modulo p to said remote party, and starting a clock; 

means for receiving a third ciphertext and g y modulo p from said remote party, 
stopping the clock, and computing an elapsed time interval of said clock; 

means for deriving a key k B from g y modulo p, computing g* y modulo p, 
deriving a key k^ from g xy modulo p, decrypting said second ciphertext with k B to 
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recover a second challenge signal from said remote party, decrypting said third 
ciphertext to recover a first response signal from said remote party; 

means for verifying that said elapsed time of the clock is within a 
predetermined interval (TL A , TUa), where TL A and TU A are positive numbers; 

means for verifying that said second challenge signal is produced by said 
remote party; 

means for producing a second response signal of minimum duration T, 
encrypting said second response signal with and sending a fourth ciphertext to 
said remote party; 

means for verifying that said first response signal is a response produced by 
said remote party to said first challenge signal; and 

means for generating a key k from g xy modulo p for secure communications 
with said remote party. 

9. The apparatus according to claim 8, wherein T is larger than the 
channel transmission and processing delay. 

10. The apparatus according to claim 8, wherein said challenge signals and 
response signals represent biometrics characteristics of the producing parties. 

1 1. The apparatus according to claim 10, wherein said biometrics 
characteristics include the voice of a person. 

12. The apparatus according to claim 8 or 10, wherein verification of said 
first response signal and said second challenge signal from said remote party is based 
on familiarity of remote party's biometrics characteristics. 

13. The apparatus according to claim 8, wherein encryption of said 
challenge and response signals is performed using a cryptographic commitment 
function. 
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14. The apparatus according to claim 8, where TL A is tj + ^ and TU A is tj + 
tj + T, with t, being the duration of said first challenge signal and ^ being the duration 
of said first response signal. 

15. A computer program product having a computer usable medium 
having a computer readable program code means embodied therein for authenticating 
a remote party and establishing a cryptographic key for secure communications via an 
insecure communications channel, said computer program product including: 

computer readable program code means for generating a first challenge signal 
of minimum duration T, where T is a fixed time interval; 

computer readable program code means for generating a random number x, 
computing g x modulo p, where g and p are numbers, deriving a key k A from g x modulo 
p, encrypting said first challenge signal with k A and a symmetric key cryptosystem, 
and sending a first ciphertext to said remote party; 

computer readable program code means for receiving a second ciphertext from 
said remote party, sending g x modulo p to said remote party, and starting a clock; 

computer readable program code means for receiving a third ciphertext and g y 
modulo p from said remote party, stopping the clock, and computing an elapsed time 
interval of said clock; 

computer readable program code means for deriving a key k B from g y modulo 
p, computing g xy modulo p, deriving a key k^ from g^ modulo p, decrypting said 
second ciphertext with k B to recover a second challenge signal from said remote party, 
decrypting said third ciphertext to recover a first response signal from said remote 
party; 

computer readable program code means for verifying that said elapsed time of 
the clock is within a predetermined interval (TL A , TUa), where TL A and TU A are 
positive numbers; 

computer readable program code means for verifying that said second 
challenge signal is produced by said remote party; 

computer readable program code means for producing a second response 
signal of minimum duration T, encrypting said second response signal with k^ and 
sending a fourth ciphertext to said remote party; 
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computer readable program code means for verifying that said first response 
signal is a response produced by said remote party to said first challenge signal; and 

computer readable program code means for generating a key k from g* y 
modulo p for secure communications with said remote party. 

16. The computer program product according to claim 15, wherein T is 
larger than the channel transmission and processing delay. 

17. The computer program product according to claim 15, wherein said 
challenge signals and response signals represent biometrics characteristics of the 
producing parties. 

18. The computer program product according to claim 17, wherein said 
biometrics characteristics include the voice of a person. 

19. The computer program product according to claim 15 or 17, wherein 
verification of said first response signal and said second challenge signal from said 
remote party is based on familiarity of remote party's biometrics characteristics. 

20. The computer program product according to claim 15, wherein 
encryption of said challenge and response signals is performed using a cryptographic 
commitment function. 

21. The computer program product according to claim 15, where TL A is tj 
+ 1^ and TU A is t, + 1 2 + T, with tj being the duration of said first challenge signal and 

being the duration of said first response signal. 

22. A method of authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said method including the steps of: 

receiving a first ciphertext from said remote party, generating a random 
number y, computing g y modulo p, where g and p are numbers; 
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producing a first challenge signal of a minimum duration T, where T is a fixed 
time interval; 

deriving a key k B from g y modulo p, encrypting said first challenge signal with 
k B and a symmetric key cryptosystem, and sending a second ciphertext to said remote 
party; 

receiving g x modulo p from said remote party, deriving a key k A from and g x 
modulo p, decrypting said first ciphertext to recover a second challenge signal from 
said remote party; 

verifying that said second challenge signal is produced by said remote party, 
producing a first response signal of said minimum duration T; 

computing g xy modulo p, deriving a key k^ from g xy modulo p, encrypting 
said first response signal, sending a third ciphertext and g y modulo p to said remote 
party, and starting a clock; 

receiving a fourth ciphertext, stopping said clock, computing the elapsed time 
of the clock, and decrypting the fourth ciphertext to recover a second response signal 
from said remote party; 

verifying that said elapsed time of said clock is within a predetermined interval 
(TLg, TU B ), where TLg and TU B are positive numbers; 

verifying that said second response signal is a response produced by said 
remote party to said first challenge signal; and 

generating a key k from g xy modulo p for secure communications with the 
remote party. 

23. The method according to claim 22, wherein T is larger than the channel 
transmission and processing delay 

24. The method according to claim 22, wherein said challenge signals and 
response signals are signals representing biometrics characteristics. 

25. The method according to claim 22, wherein verification of said second 
challenge signal and said second response signal from remote party is based on 
familiarity with a remote party's biometrics characteristics. 
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26. The method according to claim 22, wherein encryption of said 
challenge and response signals is carried out by a cryptographic commitment function. 

27. The method according to claim 22, wherein TLq is t 3 + t 4 and TU B is t 3 
+ t 4 + T, with t 3 being the duration of the first challenge signal and t 4 being the 
duration of the second response signal. 

28. An apparatus for authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said apparatus including: 

means for receiving a first ciphertext from said remote party, generating a 
random number y, computing g y modulo p, where g and p are numbers; 

means for producing a first challenge signal of a minimvmi duration T, where 
T is a fixed time interval; 

means for deriving a key k B from g y modulo p, encrypting said first challenge 
signal with k B and a symmetric key cryptosystem, and sending a second ciphertext to 
said remote party; 

means for receiving g* modulo p from said remote party, deriving a key k A 
from and g x modulo p, decrypting said first ciphertext to recover a second challenge 
signal from said remote party; 

means for verifying that said second challenge signal is produced by said 
remote party, producing a first response signal of said minimum duration T; 

means for computing g xy modulo p, deriving a key k^ from g xy modulo p, 
encrypting said first response signal, sending a third ciphertext and g y modulo p to 
said remote party, and starting a clock; 

means for receiving a fourth ciphertext, stopping said clock, computing the 
elapsed time of the clock, and decrypting the fourth ciphertext to recover a second 
response signal from said remote party; 

means for verifying that said elapsed time of said clock is within a 
predetermined interval (TLg, TU B ), where TLq and TU B are positive numbers; 
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means for verifying that said second response signal is a response produced by 
said remote party to said first challenge signal; and 

means for generating a key k from g* y modulo p for secure communications 
with the remote party. 

29. The apparatus according to claim 28, wherein T is larger than the 
channel transmission and processing delay 

30. The apparatus according to claim 28, wherein said challenge signals 
and response signals are signals representing biometrics characteristics. 

3 1 . The apparatus according to claim 28, wherein verification of said 
second challenge signal and said second response signal from remote party is based 
on familiarity with a remote party's biometrics characteristics. 

32. The apparatus according to claim 28, wherein encryption of said 
challenge and response signals is carried out by a cryptographic commitment function. 

33. The apparatus according to claim 28, wherein TLq is t 3 + t 4 and TU B is 
t 3 + t 4 + T, with t 3 being the duration of the first challenge signal and t 4 being the 
duration of the second response signal. 

34. A computer program product having a computer usable medium 
having a computer readable program code means embodied therein for authenticating 
a remote party and establishing a cryptographic key for secure communications via an 
insecure communications channel, said computer program product including: 

computer readable program code means for receiving a first ciphertext from 
said remote party, generating a random number y, computing g y modulo p, where g 
and p are numbers; 

computer readable program code means for producing a first challenge signal 
of a minimum duration T, where T is a fixed time interval; 
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computer readable program code means for deriving a key k B from g y modulo 
p, encrypting said first challenge signal with k B and a symmetric key cryptosystem, 
and sending a second ciphertext to said remote party; 

computer readable program code means for receiving g* modulo p from said 
remote party, deriving a key k A from and g x modulo p, decrypting said first ciphertext 
to recover a second challenge signal from said remote party; 

computer readable program code means for verifying that said second 
challenge signal is produced by said remote party, producing a first response signal of 
said minimum duration T; 

computer readable program code means for computing g xy modulo p, deriving 
a key k^ from g xy modulo p, encrypting said first response signal, sending a third 
ciphertext and g y modulo p to said remote party, and starting a clock; 

computer readable program code means for receiving a fourth ciphertext, 
stopping said clock, computing the elapsed time of the clock, and decrypting the 
fourth ciphertext to recover a second response signal from said remote party; 

computer readable program code means for verifying that said elapsed time of 
said clock is within a predetermined interval (TLb, TU b ), where TLb and TU B are 
positive numbers; 

computer readable program code means for verifying that said second 
response signal is a response produced by said remote party to said first challenge 
signal; and 

computer readable program code means for generating a key k from g* y 
modulo p for secure communications with the remote party. 

35. The computer program product according to claim 34, wherein T is 
larger than the channel transmission and processing delay 

36. The computer program product according to claim 34, wherein said 
challenge signals and response signals are signals representing biometrics 
characteristics. 
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37. The computer program product according to claim 34, wherein 
verification of said second challenge signal and said second response signal from 
remote party is based on familiarity with a remote party's biometrics characteristics. 

38. The computer program product according to claim 34, wherein 
encryption of said challenge and response signals is carried out by a cryptographic 
commitment function. 

39. The computer program product according to claim 34, wherein TLb is 
t 3 + t 4 and TU B is t 3 + t 4 + T, with t 3 being the duration of the first challenge signal and 
t 4 being the duration of the second response signal. 

40. A method of authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said method including the steps of: 

generating a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

generating a random number x, computing g* modulo p, where g and p are 
numbers, deriving a key k A from g* modulo p, encrypting said first challenge signal 
with k A and a symmetric key cryptosystem, and sending a first ciphertext to said 
remote party; 

receiving a second ciphertext, sending g* modulo p to said remote party, and 
starting a clock; 

receiving g y modulo p, computing a key k B from g y modulo p, decrypting the 
second ciphertext to recover a second challenge signal from said remote party; 

verifying said second challenge statement to ensure that said second challenge 
statement is produced by said remote party, and producing a first response signal of 
minimum duration T; 

computing g xy modulo p, deriving a key k^ from g xy modulo p, encrypting 
said first response signal and sending a third ciphertext to said remote party; 
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receiving a fourth ciphertext from said remote party, stopping said clock, 
decrypting the fourth ciphertext with to recover a second response signal from 
said remote party; 

verifying that said elapsed time of said clock is within a predetermined interval 
(tl A , tuj, where tl A and tu A are positive numbers; 

verifying that said second response signal is a response produced by said 
remote party to said first challenge signal; and 

generating a key k from g xy modulo p for secure communications with said 
remote party. 

41 . The method according to claim 40, wherein T is larger than the channel 
transmission and processing delay 

42. The method according to claim 40, wherein said challenge signals and 
response signals are signals representing biometrics characteristics. 

43. The method according to claims 40, wherein verification of said 
second response signal and said second challenge signal from remote party is based 
on familiarity of remote party's biometrics characteristics. 

44. The method according to claim 40, wherein encryption of said 
challenge and response signals is carried out by a cryptographic commitment function. 

45. The method according to claim 40, where tl A is T l + T 2 and tu A is T, + 
T 2 + T, with Tj being the duration of said first challenge signal and T 2 being the 
duration of said second response signal. 

46. An apparatus for authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said apparatus including: 

means for generating a first challenge signal of minimum duration T, where T 
is a fixed time interval; 
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means for generating a random number x, computing g* modulo p, where g 
and p are numbers, deriving a key k A from g* modulo p, encrypting said first challenge 
signal with k A and a symmetric key cryptosystem, and sending a first ciphertext to 
said remote party; 

means for receiving a second ciphertext, sending g x modulo p to said remote 
party, and starting a clock; 

means for receiving g y modulo p, computing a key k B from g y modulo p, 
decrypting the second ciphertext to recover a second challenge signal from said 
remote party; 

means for verifying said second challenge statement to ensure that said second 
challenge statement is produced by said remote party, and producing a first response 
signal of minimum duration T; 

means for computing g xy modulo p, deriving a key k^ from g xy modulo p, 
encrypting said first response signal and sending a third ciphertext to said remote 
party; 

means for receiving a fourth ciphertext from said remote party, stopping said 
clock, decrypting the fourth ciphertext with k^ to recover a second response signal 
from said remote party; 

means for verifying that said elapsed time of said clock is within a 
predetermined interval (tl A , tu A ), where tl A and tu A are positive numbers; 

means for verifying that said second response signal is a response produced 
by said remote party to said first challenge signal; and 

means for generating a key k from g xy modulo p for secure communications 
with said remote party. 

47. The apparatus according to claim 46, wherein T is larger than the 
channel transmission and processing delay 

48. The apparatus according to claim 46, wherein said challenge signals 

and response signals are signals representing biometrics characteristics. 
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49. The apparatus according to claims 46, wherein verification of said 
second response signal and said second challenge signal from remote party is based 
on familiarity of remote party's biometrics characteristics. 

50. The apparatus according to claim 46, wherein encryption of said 
challenge and response signals is carried out by a cryptographic commitment function. 

5 1 . The apparatus according to claim 46, where tl A is T, + T 2 and tu A is T, 
+ T 2 + T, with T, being the duration of said first challenge signal and T 2 being the 
duration of said second response signal. 

52. A computer program product having a computer usable medium 
having a computer readable program code means embodied therein for authenticating 
a remote party and establishing a cryptographic key for secure communications via an 
insecure communications channel, said computer program product including: 

computer readable program code means for generating a first challenge signal 
of minimum duration T, where T is a fixed time interval; 

computer readable program code means for generating a random number x, 
computing g* modulo p, where g and p are numbers, deriving a key k A from g* modulo 
p, encrypting said first challenge signal with k A and a symmetric key cryptosystem, 
and sending a first ciphertext to said remote party; 

computer readable program code means for receiving a second ciphertext, 
sending g* modulo p to said remote party, and starting a clock; 

computer readable program code means for receiving g y modulo p, computing 
a key k B from g y modulo p, decrypting the second ciphertext to recover a second 
challenge signal from said remote party; 

computer readable program code means for verifying said second challenge 
statement to ensure that said second challenge statement is produced by said remote 
party, and producing a first response signal of minimum duration T; 

computer readable program code means for computing g xy modulo p, deriving 
a key k^ from g 3 * modulo p, encrypting said first response signal and sending a third 
ciphertext to said remote party; 
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computer readable program code means for receiving a fourth ciphertext from 
said remote party, stopping said clock, decrypting the fourth ciphertext with to 
recover a second response signal from said remote party; 

computer readable program code means for verifying that said elapsed time of 
said clock is within a predetermined interval (tl A , tu^J, where tl A and tu A are positive 
numbers; 

computer readable program code means for verifying that said second 
response signal is a response produced by said remote party to said first challenge 
signal; and 

computer readable program code means for generating a key k from g* y 
modulo p for secure communications with said remote party. 

53. The computer program product according to claim 52, wherein T is 
larger than the channel transmission and processing delay 

54. The computer program product according to claim 52, wherein said 
challenge signals and response signals are signals representing biometrics 
characteristics. 

55. The computer program product according to claims 52, wherein 
verification of said second response signal and said second challenge signal from 
remote party is based on familiarity of remote party's biometrics characteristics. 

56. The computer program product according to claim 52, wherein 
encryption of said challenge and response signals is carried out by a cryptographic 
commitment function. 

57. The computer program product according to claim 52, where tl A is T, + 
T 2 and tu A is Tj + T 2 + T, with T t being the duration of said first challenge signal and 
T 2 being the duration of said second response signal. 



-40- 

58. A method of authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said method including the steps of: 

receiving a first ciphertext from remote party, generating a random number y, 
and computing g y modulo p, where g and p are numbers; 

producing a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

deriving a key kjj from g y modulo p, encrypting said first challenge signal with 
k B and a symmetric key cryptosystem, and sending a second ciphertext; 

receiving g* modulo p, computing a key k A from g* modulo p, decrypting said 
first ciphertext to recover a second challenge signal from remote party, sending g y to 
remote party and starting a clock; 

verifying said second challenge statement to make sure that said second 
challenge statement is produced by said remote party, and then producing a first 
response signal of minimum duration T; 

computing g* y modulo p, deriving a key k^ from g xy modulo p, encrypting 
said first response signal and sending a third ciphertext to said remote party; 

receiving a fourth ciphertext from said remote party, stopping said clock, 
decrypting said fourth ciphertext with k^ to recover a second response signal from 
said remote party; 

verifying that said elapsed time of the clock is within an interval (tl B , tu B ), 
where tl B and tu B are positive numbers; 

verifying that said second response signal a response produced by said remote 
party to said first challenge signal; and 

generating a key k from g xy modulo p for secure communications with the 
remote party. 

59. The method according to claim 58, wherein T is larger than the channel 
transmissionand processing delay 

60. The method according to claim 58, wherein said challenge signals and 
response signals are signals representing biometrics characteristics. 
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61. The method according to claim 58, wherein verification of said second 
challenge signal and said second response signal from said remote party is based on 
familiarity of said remote party's biometrics characteristics. 

62. The method according to claim 58, wherein encryption of challenge 
and response signals are carried out by a cryptographic commitment function. 

63. The method according to claim 58, where tl B is T 3 + T 4 and tu B is T 3 + 
T 4 + T, with T 3 being the duration of said first challenge signal and T 4 being said 
duration of said second response signal. 

64. An apparatus for authenticating a remote party and establishing a 
cryptographic key for secure communications via an insecure communications 
channel, said apparatus: 

means for receiving a first ciphertext from remote party, generating a random 
number y, and computing g y modulo p, where g and p are numbers; 

means for producing a first challenge signal of minimum duration T, where T 
is a fixed time interval; 

means for deriving a key k B from g y modulo p, encrypting said first challenge 
signal with k B and a symmetric key cryptosystem, and sending a second ciphertext; 

means for receiving g* modulo p, computing a key k A from g* modulo p, 
decrypting said first ciphertext to recover a second challenge signal from remote 
party, sending g y to remote party and starting a clock; 

means for verifying said second challenge statement to make sure that said 
second challenge statement is produced by said remote party, and then producing a 
first response signal of minimum duration T; 

means for computing g* y modulo p, deriving a key k^ from g* y modulo p, 
encrypting said first response signal and sending a third ciphertext to said remote 
party; 
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means for receiving a fourth ciphertext from said remote party, stopping said 
clock, decrypting said fourth ciphertext with to recover a second response signal 
from said remote party; 

means for verifying that said elapsed time of the clock is within an interval 
(tl B , tu B ), where tl B and tu B are positive numbers; 

means for verifying that said second response signal a response produced by 
said remote party to said first challenge signal; and 

means for generating a key k from g xy modulo p for secure communications 
with the remote party. 

65. The apparatus according to claim 64, wherein T is larger than the 
channel transmission and processing delay 

66. The apparatus according to claim 64, wherein said challenge signals 
and response signals are signals representing biometrics characteristics. 

67. The apparatus according to claim 64, wherein verification of said 
second challenge signal and said second response signal from said remote party is 
based on familiarity of said remote party's biometrics characteristics. 

67. The apparatus according to claim 64, wherein encryption of challenge 
and response signals are carried out by a cryptographic commitment function. 

68. The apparatus according to claim 64, where tl B is T 3 + T 4 and tu B is T 3 
+ T 4 + T, with T 3 being the duration of said first challenge signal and T 4 being said 
duration of said second response signal. 

69. A computer program product having a computer usable medium 
having a computer readable program code means embodied therein for authenticating 
a remote party and establishing a cryptographic key for secure communications via an 
insecure communications channel, said computer program product: 
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computer readable program code means for receiving a first ciphertext from 
remote party, generating a random number y, and computing g y modulo p, where g 
and p are numbers; 

computer readable program code means for producing a first challenge signal 
of minimum duration T, where T is a fixed time interval; 

computer readable program code means for deriving a key k B from g y modulo 
p, encrypting said first challenge signal with k B and a symmetric key cryptosystern, 
and sending a second ciphertext; 

computer readable program code means for receiving g x modulo p, computing 
a key k A from g* modulo p, decrypting said first ciphertext to recover a second 
challenge signal from remote party, sending g y to remote party and starting a clock; 

computer readable program code means for verifying said second challenge 
statement to make sure that said second challenge statement is produced by said 
remote party, and then producing a first response signal of minimum duration T; 

computer readable program code means for computing g* y modulo p, deriving 
a key k^ from g* y modulo p, encrypting said first response signal and sending a third 
ciphertext to said remote party; 

computer readable program code means for receiving a fourth ciphertext from 
said remote party, stopping said clock, decrypting said fourth ciphertext with k^ to 
recover a second response signal from said remote party; 

computer readable program code means for verifying that said elapsed time of 
the clock is within an interval (tl B , tu B ), where tl B and tu B are positive numbers; 

computer readable program code means for verifying that said second 
response signal a response produced by said remote party to said first challenge 
signal; and 

computer readable program code means for generating a key k from g* y 
modulo p for secure communications with the remote party. 

70. The computer program product according to claim 69, wherein T is 
larger than the channel transmission and processing delay 
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71 . The computer program product according to claim 69, wherein said 
challenge signals and response signals are signals representing biometrics 
characteristics. 

72. The computer program product according to claim 69, wherein 
verification of said second challenge signal and said second response signal from said 
remote party is based on familiarity with said remote party's biometrics 
characteristics. 

73. The computer program product according to claim 69, wherein 
encryption of challenge and response signals are carried out by a cryptographic 
commitment function. 

74. The computer program product according to claim 69, where tl B is T 3 + 
T 4 and tu B is T 3 + T 4 + T, with T 3 being the duration of said first challenge signal and 
T 4 being said duration of said second response signal. 
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REMOTE AUTHENTICATION BASED ON EXCHANGING 
SIGNALS REPRESENTING BIOMETRICS INFORMATION 



ABSTRACT 

A method, an apparatus, and a computer program product for remote authentication 
are disclosed. The methods are based on exchanging of signals representing remote 
party's biometrics information (such as acoustic waveforms) and have application in 
secure telephony or video-conferencing communications over open networks. The 
apparatus includes a speech encoding/decoding module (632), a control module (636), 
an encryption/decryption module (640), a key generator (650), a Diffie Hellman key 
exchange system (660), a timer module (670) for measuring time between a 
challenged statement and a corresponding response statement, an input/output (I/O) 
module (680) for transmitting and receiving data via a communications channel 610. 
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